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Abstract 

The combination of data from long-baseline and reactor oscillation experiments 
leads to a preference of the leptonic CP phase (5cp in the range between tt and 27r. We 
study the statistical significance of this hint by performing a Monte Carlo simulation 
of the relevant data. We find that the distribution of the standard test statistic used to 
derive confidence intervals for (5cp is highly non-Gaussian and depends on the unknown 
true values of 623 and the neutrino mass ordering. Values of (5cp around it/2 are 
disfavored at between 2a and 3a, depending on the unknown true values of 623 and the 
mass ordering. Typically the standard approximation leads to over-coverage of the 
confidence intervals for Sqp- For the 2-dimensional confidence region in the (hcp,023) 
plane the usual approximation is better justihed. The 2-dimensional region does 
not include the value (5cp = ^/2 up to the 86.3% (89.2%) CL assuming a true normal 
(inverted) mass ordering. Furthermore, we study the sensitivity to 5cp aud 623 of 
an increased exposure of the T2K experiment, roughly a factor 12 larger than the 
current exposure and including also anti-neutrino data. Also in this case deviations 
from Gaussianity may be signihcant, especially if the mass ordering is unknown. 
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1 Introduction 

Thanks to a beautiful series of neutrino oscillation experiments [1-15] we have now a clear 
picture of the mixing pattern in the lepton sector. Neutrino oscillations depend on two 
neutrino mass-squared differences Am^i, Am 32 (with = mj — rn^-), three mixing angles, 
6 'i 25 ^ 23 , ^ 13 , and one complex phase 5cp, where we adopt the standard parameterization for 
the leptonic mixing matrix [16]. Out of those six parameters the three mixing angels and the 
two mass-squared differences are well determined by global data [17-19], up to the sign of 
Am \2 which parametrizes two possible orderings of the neutrino mass states, normal ordering 
(NO) versus inverted ordering (10). 

One of the ultimate goals of neutrino oscillation physics is to determine the complex phase 
5cp- Values of ^cp different from zero and tt imply CP violation in the lepton sector [20-22], 
see refs. [23,24] for reviews. Determining (5cp and possibly establishing CP violation with 
reasonable precision is a formidable task which most likely will require high-intensity neutrino 
beams beyond the current generation of experiments. Nevertheless, already with current 
experiments, some first hints on a preferred range of 5cp may be obtained at a modest 
confidence level, see for instance [25-29] for estimates. Indeed, currently available global 
data seem to indicate a slight preference for the range tt < (5cp < 27r compared to 0 < (5cp < 
TT [17-19]. This hint emerges mostly from the combination of the —)■ observation from 
the long-baseline experiments T2K [13] and MINOS [ 6 ] with the determination of the mixing 
angle 613 by reactor experiments DayaBay [12], RENO [10], and DoubleChooz [9]. 

Assessing the significance of this hint is a non-trivial task. Standard statistical tools 
usually employed in global fits of neutrino oscillation data are likely to fail for the determi¬ 
nation of 5cp. The reason is that several conditions for Wilks theorem [30] are violated in 
this case: statistics is low, the sensitivity of the data to the parameter is rather poor, and 
predictions depend non-linearly on the parameter (via trigonometric functions). In the con¬ 
text of present data those issues have been commented on in ref. [17], based on preliminary 
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simulations of the statistical properties of the used statistics. Similar considerations for 
future experiments can be found in refs. [31,32]. 

In this work we extend the results of ref. [17] and study in detail the distribution of 
the relevant test statistics by generating large samples of pseudo data and constructing 
conhdence intervals or regions with the correct coverage following the Feldman-Cousins pre¬ 
scription [33]. We study the behavior as a function of the unknown true values of 5 cP)^ 23, 
and the neutrino mass ordering. In refs. [31,32] the sensitivity to CP violation has been stud¬ 
ied, whereas in this work we concentrate on the related but different problem of constructing 
conhdence intervals for ^cp- In addition to analyzing present data, we also investigate the 
behavior of the test statistics assuming an increased exposure of the T2K experiment, to be 
expected in the timescale of several years. We attempt to provide an explanation of our nu¬ 
merical results by considering the non-linear structure of the relevant oscillation probabilities 
including parameter degeneracies. 

The outline of the paper is as follows. In section 2 we briehy describe the data from the 
T2K and MINOS experiments, introduce the relevant test statistics, and discuss the statis¬ 
tical analysis based on the Monte Carlo simulations. In section 3 we consider the relevant 
oscillation probabilities and provide a discussion about why deviations from Gaussianity can 
be expected. Our results analyzing present data are presented in section 4. We discuss the 
distributions of the 1-dimensional Ax^ statistics for ^cp and 623 , hnding large non-Gaussian 
behavior, especially for dcp- Then we construct the 2-dimensional regions in the (5 cP)^ 23) 
plane and hnd them to be much closer to the Gaussian approximation. In section 5 we 
investigate how this situation will change, once more data become available. We study the 
sensitivity of T2K by increasing the exposure by roughly a factor 12 compared to the present 
one including also anti-neutrino data. We hnd even in that situation deviations from the 
Gaussian approximation remain signihcant in certain regions of the parameter space. We 
summarize and conclude in section 6. In the appendix we show the impact of hrst data on 
anti-neutrinos from T2K [34], which appeared after the completion of this work. 


2 Description of data and statistical analysis 

In this work we use the data from the long-baseline experiments T2K and MINOS, both from 
the appearance and disappearance channels, including also a small anti-neutrino data sample 
from MINOS, see Tab. 1 for details and references.^ Our code departs from the re-analysis 
of the data developed in the context of the NuFit collaboration and used in ref. [17]. For 
each of the six data samples shown in Tab. 1 we perform a spectral ht, where the numbers 
of spectral bins are given in the table. Our predictions of the event spectra (0) have been 
calibrated in order to reproduce the expected spectra provided by the collaborations. Here 
r runs over the six data samples, i labels the energy bins, and 0 collectively denotes the 
oscillation parameters. Each data set is described by a statistics appropriate for Poisson 

^Preliminarry results from a T2K anti-neutrino run have been released recently at EPS HEP 2015 [34], 
consisting of 3 events in the appearance channel. We comment on the impact of these data in the appendix. 
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Experiment 

Channel 

Exposure (p.o.t.) 

Ref. 

Data points 

Events 

T2K 

-t t'/. 

6.57 X 10^“ 

[14] 

16 

120 

T2K 


6.57 X lO^o 

[13] 

5 

28 

MINOS 

Vf, Vf, 

10.71 X lO^o 

[7] 

39 

2782 

MINOS 

i'll -t i'll 

3.36 X 10^0 

[7] 

14 

222 

MINOS 

t'/x 

10.6 X lO^o 

[ 6 ] 

5 

88 

MINOS 

-t W 

3.3 X lO^o 

[ 6 ] 

5 

9 


Table 1: Summary of used data. The last two columns give the number of bins used to fit the energy 
spectrum, and the total number of observed events, respectively. Recent T2K data on the —>■ Vf, channel 
[34] are not used in the main text, but we show some results including them in the appendix. 
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where is the observed number of events, and is the systematic over-all normalization 
error included via the pull parameters a^. Since those data are largely statistics domi¬ 
nated, this simple treatment of systematic errors suffice. For each experiment our analysis 
is validated by checking that when analyzing the data in the same way we can reproduce 
the conhdence regions in parameter space obtained by the experimental collaborations with 
good accuracy. When combining the data samples given in Tab. 1 we simply add the 
functions, 

y(e) = 5^y(e), (3) 

r 

ignoring possible correlated systematic errors between the data sets. The results from the 
reactor experiments [9,10,12] are taken into account implicitly by hxing ^13 = 8.5°. 

Below we are going to focus on the parameters hcP) ^ 23 , and Am 32 , which currently have 
the largest uncertainties, including the sign of Am‘^ 2 - The other oscillation parameters are 
hxed to 9 i2 = 33.5°, ^13 = 8.5°, Am^i = 7.5 x 10“®eV^. Those parameters are known within 
better than 15% at 3a ^ and we expect that hxing those parameters has only a small impact 
on our results. Hence, in the notation used above we have 0 = {hcP; ^ 23 , Am 32 }. 

If we are interested in conhdence regions of one of the parameters 0 = {5cp, 6 * 23 , Am 32 }, 
irrespective of the others, we consider the following test statistic. Taking for example 5 cp; 
we dehne 

Ax^(5cp) = min x2(0)-x%^, (4) 

where Xmin global minimum of the with respect to all parameters 0. Note that 

when minimizing over Am| 2 , we always take into account both signs, i.e. we minimize also 
over the two mass orderings. Similar dehnitions apply for the other 1-dimensional cases, 

^Here we define the precision by 2(a;“P — where and are the upper and lower 

ends of the 3cr intervals [17], respectively. 
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Ax^( 6 ' 23 ) and Ax^(Am 32 ). The 2-dimensional confidence regions are based on an analogous 
definition, e.g., 

Ax^(^cp, ^ 23 ) = niin x^(0) - xLn ■ (5) 

This procedure is equivalent to the profile-likelihood method to treat nuisance parameters. 

Wilks theorem [30] implies that under certain conditions, the test statistics from eqs. (4) 
and (5) are distributed according to the ^^-distribution with 1 and 2 degrees of freedom 
(dof), respectively. This is the basis of the standard method to derive confidence regions for 
the parameters, using the condition Ay^ < f^ 2 (CL, dof). We refer to as “cut levels”, and 
their values can be obtained by integrating the corresponding distribution. For instance, 
the 1-dimensional intervals at la, 2a, 3a are derived by Ay^ < 1,4, 9, respectively.^ We will 
refer to this situation as the “Gaussian limit” or the limit” in the following. 

Wilks theorem applies if the theoretical predictions Tj(0) span a linear space when 0 
is varied. For instance, this is the case if Tj(0) can be expanded to linear order: Tj(0) 

Ai + BiQ. This is trivially fulfilled for a linear model, where this relation is exact. For 
non-linear models Tj(0), the linear approximation will hold in the vicinity of the best fit 
point and will be reliable up to a certain CL, beyond which the non-linear character of the 
parameter dependence can lead to deviations from the Gaussian limit. For “powerful” data, 
which constrain the parameter efficiently, the linear approximation will hold up to a high CL, 
whereas for “weak” data with poor sensitivity to the parameter it will break down already at 
low CL. Deviations from Gaussianity are expected for example close to a physical boundary 
of a parameter, or when certain values of the predictions Tj(0) cannot be reached due to 
the parameter dependence of the model, for instance via trigonometric functions, as we are 
going to see below. 

In order to study deviations from the Gaussian limit we have performed Monte Carlo 
(MG) simulations and calculated the distributions of the test statistics numerically. For 
assumed true values of the parameters 0 *™® we generate artificial pseudo data by assuming 
a Poisson distribution for the observables with mean T”(0*''“®). Those data are multiplied 
by a random Gaussian number with mean 1 and standard deviation al^^ in order to take 
into account the systematic normalization uncertainty. In this approach the origin of the 
systematic uncertainty is related to some auxiliary measurements, determining for instance 
the fiducial volume of the detector or the beam normalization. The measured values of those 
experiments are used to determine the theoretical predictions. Hence, the normalization of 
the predictions is subject to statistical fluctuations of the auxiliary measurements. We set 
the unknown true value of the normalization constant to 1 and generate random realisations 
of this number with standard deviation a” , which then enters the “observables” for the MG 
generated data. This implies that we consider the auxiliary measurements as part of the 
experiment, also hypothetically to be repeated many times. However, we have checked that 
our results do not depend on whether we randomize the systematic error or not. 

Then the test statistics eqs. (4) or (5) are calculated at the point e.g., Ay^(^jQj(5cp), 
where the subscript MG[0] indicates that the pseudo data from the Monte Garlo generated 
at the point 0 are used in the y^. We perform this calculation times for each point in the 
0 true which provides us the true distribution of the test statistic for each parameter 

^In this paper we use the two-sided Gaussian convention to convert standard deviations into CL, which 
implies that Icr, 2a, 3cr correspond to 68.27%, 95.45%, 99.73% CL, respectively. 
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value. From those histograms we can obtain the true cut levels tMc(CL, 0) for a given CL 
a, by demanding that a fraction a of all pseudo experiments fulhlls 

^Xmc[0](^cp) < ^mc(CL, 0). (6) 

Then the correct conhdence intervals (or regions) for the parameters are obtained by those 
values of the parameters for which the test statistics of the real data fulhlls Ax^(5cp) < 
^mc(CL,0). Those intervals have the correct coverage by construction and follow the pre¬ 
scription of Feldman and Cousins [33]. 

Here we used the test statistic for (5cp as an example, but analogous expressions hold for 
the other 1-dimensional as well as 2-dimensional test statistics. Note that in the left side 
of eq. ( 6 ), is evaluated at (5cp corresponding to the same value as used to generate the 
pseudo data. However, although the test statistic for given data depends only on ^cp, fhe 
MC results do depend also on the other parameters 623 and Am‘^ 2 - Hence, the conhdence 
intervals for ^cp niay depend on the unknown true values of the other parameters, an ehect 
we will indeed observe in our numerical studies presented in the next section. 


3 Discussion of oscillation probabilities 


In the case of interest, we are facing a complicated parameter dependence of the predictions. 
We review here the relevant oscillation probabilities through which the parameters enter the 
event rate predictions. Note that we do not use the approximate expression for the numerical 
work (which is based on numerical calculations of the full three-havor probabilities including 
the matter ehect) but they serve well for a qualitative understanding. 

Let us dehne 


A = 


2 I 

311 


AE 


A = 


2EV 


Amii 


(7) 


where L is the baseline, E is the neutrino energy, and V is the ehective matter potential. 
For the z/„ disappearance channel we have 


Fdis ~ sin^ 2023 sin^ A , 


( 8 ) 


where we show only the leading term and neglect corrections due to A 21 as well as ^ 13 . An 
approximate expression for the —)■ oscillation probability, valid for a constant matter 
density is given by [35,36]: 


p A 


app 


4 sin 9 i 3 sin 623 


2 sin^ A (1 — asA) 


(1 — asAy 


+ s 


Arrio 


21 


I Am 


sin 26*13 sin 26*12 sin 26*23 cos(sA 4-a5cp) 


sin AA sin A(1 — asA) 


A 


1 — asA 


(9) 


The signs a and s describe the ehects of CP-conjugation and the neutrino mass ordering, 
respectively, with a = -|-1 for neutrinos and a = —1 for anti-neutrinos, and s = sgn(Am|]^). 
The matter ehect enters via the parameter A. Numerically one hnds for a matter density of 
3 g/cw? 


A ~ 0.094 



I Am 


311 


2.4 X 10-3 eV^ 



( 10 ) 
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Hence, for the T2K experiment, with E ~ 0.7 GeV, the matter effect is of order 6% and 
we can expand eq. (9) also in A, keeping only terms up to first order in A. To simplify 
the expression further we assume the first oscillation maximum, A 7r/2, which is a good 
approximation for T2K. Introducing the definitions 

1 /\ T7T.^ T j 

sin^ 023 = - + d, C =—sin 2013 sin 2012 sin 2023 , (H) 

Z 4£/ 

eq. (9) becomes for neutrinos and anti-neutrinos 

Py 2sin^ 013(1 + 2d)(l -|- 2 sA) — C sindcp (1 + sA ), (12) 

Py Ri 2sin^ 013(1 + 20) (1 — 2 sA) -|- C sindcp (1 — sA). (13) 

Note that the magnitude of d is constrained by the —>■ disappearance channel, but 
we are left with the octant degeneracy for 023, described by a sign ambiguity of d, with 
0 < 0 (0 > 0) corresponding to the first (second) octant for 023. We are going to consider 
values in the range —0.1 < d < 0.1, within the currently Sa confidence interval. Hence, 
the free parameters in the problem are the continuous parameter 0 cp and the two signs of d 
and s, i.e. four discrete sign combinations. Note, however, that especially for current data 
this is an over-simplification, since the uncertainty on |0| is large. Numerically we have 
sin^0i3 0.022, C ^ 0.013 (with a very weak dependence on d for |0| < 0.1), and A ^ 0.06. 

Hence, all terms in eqs. (12) and (13) are of similar order, and both the octant [37] and the 

mass ordering [38] degeneracies will lead to changes in the predictions of similar size as 0 cp- 
Recent discussions of the 0 cp and 023 interplay can be found in refs. [29,39,40]. 

Given the parameter dependencies from eqs. (8) and (12, 13) we can expect deviations 
from the Gaussian limit for the following reasons. 

1. Present data show only weak sensitivity to 0 cp, he., the full range 0 < 0 cp < 27r is 
allowed at relatively low GL. This implies that the strong non-linearity of the trigono¬ 
metric dependence in eq. (12) comes into play. In particular, because of the sine de¬ 
pendence, only a finite change of Tj (0cp) can be achieved by varying 0 cp, in contrast 
to the unbounded variation of a linear model. This means that (5 cp effectively provides 
less than 1 dof, which implies that the true cut levels for a given GL will be lower than 
the ones corresponding to the Gaussian approximation. 

For future data with decreased statistical errors the fluctuations of the data may become 
of similar size as the compact region spanned in Tj upon varying 0 cP) which means that 
the coverage of Tj is actually more efficient than for the linear model and 0 cp thus 
provides more than 1 dof. If the exposure is further increased the fluctuations will 
become even smaller than the compact region in Tj such that the linear expansion 
becomes valid and we approach 1 dof. 

2. Disappearance data depend on sin^ 2023, which leads to the octant degeneracy for 023. 

However, due to the sin^ 023 factor in the appearance probability the degeneracy is 
not complete when —)■ Ug data are included. The two possible solutions for sin^ 023 

corresponding to the two signs of 0 imply more freedom than a single linear parameter. 
Hence the presence of the degeneracy leads to an increase of the effective dofs, which 
implies increased cut levels. 
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3. For values of 623 close to 7r/4 we are facing a physical boundary in the disappearance 
channel, since sin^ 2^23 < 1, which implies that predictions corresponding to sin^ 2^23 > 
1 cannot be reached by varying 623 . This leads to a decrease of the effective dof, and 
reduces the cut levels. 

4. Similarly, for 5 cp — ^/2 or 3tt/2 the sinffcp dependence in the appearance probabilities 
(12) and (13) imply a physical boundary due to | sin(5cp| < 1- For those values of 5cp the 
derivatives of the probabilities with respect to (5cp vanish. Hence, we expect decreased 
cut levels for those values, while for (5cp — 0 or tt the dependence of the appearance 
channel resembles approximately a linear model. The behavior around (5cp — 7r/2 
or 37r/2 is further complicated by the octant and mass ordering degeneracies, see the 
discussion related to eq. (14) below. 

In our numerical results presented below we will observe all of those effects, where some of 
them may occur simultaneously, leading to a complicated interplay of effects. Nevertheless, 
some general features can be understood qualitatively. For instance, considering the ^cp 
dependence of eqs. (12) and (13) plus the 4-fold degeneracy related to the signs of d and 
s, we hnd that there are minimal and maximal values for the oscillation probabilities (and 
hence for the event rates) given by the following combinations of the parameters: 


Dmax . 

d > 0 (2nd oct.), 

s = +1 (NO), 

(5cp — 37r/2 

Dmin . 

d < 0 (1st oct.), 

s = -l(IO), 

5cp = tt/2 

Dmax . 

d > 0 (2nd oct.), 

s = -l(IO), 

(5cp = vr/2 

Dmin . 

d < 0 (1st oct.), 

s = +1 (NO), 

(5cp = 37r/2 


If the true values of d, s, (5cp correspond to one of the combinations in eqs. (14) then we 
are located at a physical boundary for the event rates: there is no point in the parameter 
space which can provide a larger (or smaller) value of the probability. Statistical fluctuations 
leading to even larger (or smaller) event rates than predicted for those extreme parameter 
values cannot be accommodated by adjusting the model. This implies that the effective 
number of dof of the is reduced, i.e. lower cut levels. A related discussion can also be 
found in ref. [32]. 

4 Results for present data 

4.1 One-dimensional intervals for (5cp 

We start presenting the results of our simulations for the 1-dimensional Ay^ distributions 
dehned in eq. (4). In hgs. 1 and 2 we consider Ay^((5cp) for the CP phase (5cp and show the 
cut levels tuc for la, 2a, 3a. Fig. 1 uses T2K data only whereas £g. 2 uses all data given 
in Tab. 1, showing qualitatively similar results to the T2K-only case. We have checked that 
our T2K results are consistent with the ones shown by the T2K collaboration [15], in cases 
where comparison is possible. 

By comparing the curves for ^mc to the y^ approximation indicated by the dashed 
curves, we observe signihcant deviations for the Gaussian limit, with fMc(2cr) and tMc(3cT) 
being much lower than the corresponding t ^2 [17]. Furthermore we hnd large variations 
of the Ay^(5cp) distribution depending on 5cp itself, and on the assumed true values for 






Figure 1: The cut levels tMc(CL, 0) for Ax^(i5cp) from T2K data for Icr (red), 2cr (blue), 3cr (green). 
Dashed lines indicate the Gaussian approximation t^ 2 . Left, middle, right panels correspond to sin^ ^* 2 ™® = 
0.4,0.5,0.6, respectively. We take |Am§ 2 *™'*| = 2.4 x 10“^ eV^ and for the upper (lower) row we have 
assumed a true normal (inverted) mass ordering. The black solid curve shows Ay^(i5cp) using the observed 
data (same curve in all panels). 


sin^023 [17] and to a lesser extent also depending on the mass ordering. The various panels 
in figs. 1 and 2 correspond to different assumptions about and the true mass ordering. 
Note that those are the “true” values assumed for generating the pseudo data, while when 
fitting to the data we leave 623 and Am 32 free. Comparing figs. 1 and 2, we find that the 
addition of MINOS data makes some of the “dips” in the cut levels less sever, e.g. the ones 
for sin^ 623 ~ 0.4 and 5 ~ 7r/2 for both orderings. 

The behavior of the tMC curves can be understood from the discussion given in section 3. 
The reduction of tuc compared to the Gaussian limit follows from the poor sensitivity of 
the data to (5cp, which implies that the full range 0 < (5cp < 27r becomes accessible. Hence 
the trigonometric dependence becomes relevant, changing hcp provides less freedom than a 
linear parameter, and the effective number of dof becomes reduced. 

The appearance of the bumps, for instance for normal ordering and (sin^ 02™® = 0.4, hcp — 
37r/2) and (sin^ 02™® = 0.6, 5cp — 7r/2) can be understood by considering the 023 octant 
degeneracy. We show in fig. 3 the sensitivity of T2K in the (sin^ 623 , 6 cp) plane. We 
calculate so-called Asimov data, using the theoretical prediction for certain assumed true 
values without statistical fluctuations as “data”. This indicates the expected sensitivity for 
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Figure 2: Same as fig. 1 but using combined T2K and MINOS data. 

that particular set of true values for an average experiment. Regions in fig. 3 are derived 
based on the Gaussian approximation for Ax^(^cp, ^ 23 )- 

By comparing those results with the upper row of panels in £g. 1 we hnd a correlation 
with the ability to resolve the degeneracy: in cases with improved sensitivity to the octant 
(panels (b) and (h) in £g. 3) the cut levels are low (upper left panel for dcp — 7r/2 and upper 
right panel for (5cp — 37r/2 in £g. 1), whereas for cases where the degeneracy is strong for 
all values of (5cp (panels (d) and (f) in £g. 3) the cut levels are high (upper left panel for 
5cp ~ 37r/2 and upper right panel for (5cp — vr/2 in fig. 1). This shows that the presence 
of the degeneracy increases the effective number of dof for the distribution. We have 
conhrmed a similar correspondence also for the inverted ordering. Note that the overall 
sensitivity to the octant is very poor even in the cases shown in panels (d) and (f) in £g. 3, 
corresponding to Ay^ 0.5 for the wrong octant solution. However, for the distribution 
of Ay^((5cp) it is relevant that the degeneracy can be resolved for a signihcant range of (5cp 
values, while for the the cases corresponding to panels (d) and (f) in £g. 3 the degeneracy 
is present at below la for all values of ^cp- Those results can also be understood from the 
discussion related to the maximal and minimal values of the oscillation probability given in 
the hrst 2 lines of eq. (14). Fig. 1 shows low cut levels for those combinations of parameters, 
following from the physical boundary of the event rates. 

"*^The reduced cut levels for NO, sin^ 623 = 0.4, <5cp — seen in the left upper panel of fig. 1 do not 
follow from this argument. While d < 0 and Jcp — 7r/2 does correspond to a minimum of the oscillation 
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Figure 3: Sensitivity of T2K data in the (sin^ 6 * 23 , ^cp) plane based on Asimov data at la, 2a, 3a assuming 
sin^ ^ 2 ™® = 0.4 (upper row) and 0.6 (lower row) for different values of 5^™® as marked by the dots in the plots. 
We assume = 2.4 x 10“^ eV^ (NO). Colored areas correspond to the current exposure, whereas the 

black contour curves correspond to an exposure of 7.8x 10^^ p.o.t. neutrino data (about 12 times the current 
exposure). Regions are derived by assuming the Gaussian approximation. 


For the cut levels in figs. 1 and 2, we have always minimized with respect to both mass 
orderings. However, we have checked that assuming that the mass ordering was known (i.e., 
restricting to the true ordering in the fit) does not change the results qualitatively. This is 
different from the cut levels for increased exposure discussed in section 5, where we will find 
a significant effect of restricting the mass ordering. 

The exact confidence intervals for ^cp can be obtained by comparing the Ax^(5cp) from 
the observed data (shown as black curves in figs. 1 and 2) to the curves for the cut levels. The 
confidence interval at a given CL is obtained by those values of ^cp for which Ax^((5cp) < 
fMc(CL). We show the confidence intervals for (fcp at Icr, 2a, 3a in table 2 and fig. 4 for 
different assumptions about the true values of 623 and the mass ordering, and we compare 
them to the Gaussian approximation. Because of the dependence of ^mc ocl the values of 
the other parameters the confidence intervals for 5cp depend on those unknown true values. 
From fig. 4 we see that the la intervals are relatively stable and agree well with the Gaussian 
approximation. The variations are relatively large for the 2a interval. There is a large 
parameter dependence on the CL of rejection of (5cp — 7r/2. Using combined T2K+MINOS 
data this rejection ranges from around 3 ct for sin^ ^23 = 0.4 (both mass orderings) to only 2a 
for sin^ 623 = 0.6 (NO), see fig. 2. We obtain Ax^ = 3.37 at (5cp = 7r/2. Hence, the Gaussian 
approximation gives a rejection of ^cp = 7 r/2 at l.Sa. The dependence of the MG results on 

probability, apparently in that case changing the mass ordering from NO to IH does not provide enough 
freedom to overcome the physical boundary. 
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true sin^ 623 

true MO 

Icr 

2cr 

3cr 

0.4 

normal 

[3.20, 6.00] 

[0, 0.42] U [2.65, 27r] 

[0, 27r] 

0.5 

normal 

[3.21, 5.94] 

[0, 0.70] U [2.48, 27r] 

[0, 27r] 

0.6 

normal 

[3.16, 5.90] 

[0, 27r] 

[0, 27r] 

0.4 

inverted 

[3.11, 6.14] 

[0, 0.44] U [2.63, 27r] 

[0, 1.08] U [2.35, 27r] 

0.5 

inverted 

[3.15, 6.07] 

[0, 0.45] U [2.61, 27r] 

[0, 27r] 

0.6 

inverted 

[3.23, 5.92] 

[0, 0.55] U [2.59, 27r] 

[0, 27r] 

Gaussian limit 

[3.09, 6.20] 

[0, 27r] 

[0, 27r] 


Table 2: Confidence intervals at Icr, 2a, Scr for the CP phase dcp from combined T2K and MINOS data 
for different assumptions about the true values of 023 and the neutrino mass ordering. The last row shows 
the confidence intervals in the Gaussian approximation. 


NO - 0.4 
0.5 
0.6 
10-0.4 
0.5 
0.6 

Gaussian 

0 '7r/2 TT 37r/2 27r 




Figure 4: Graphical representation of the confidence intervals for dcp at Itr (red), 2 tT (blue), 3a (green), 
see table 2. The labels on the left side indicate the true value of sin^ 023 and the mass ordering. The bottom 
three bars correspond to the Gaussian approximation. 


the true values of other parameters is even more pronounced for T2K data only, see £g. 1. 
Note that the rejection of ^cp — 7 r /2 is stronger for T2K data-only and the signihcance 
decreases somewhat when MINOS data are included. The reason for the slight decrease of 
Ax^((5cp) when MINOS data are added to T2K is that MINOS appearance data prefer a 
somewhat smaller value of ^13 than T2K and hence the combination with the reactor result 
for 013 becomes somewhat less effective in constraining 5cp when MINOS is included. 

Within the frequentist framework there is no way to marginalize or average over the true 
values of nuisance parameters, since those are considered to be hxed constants of Nature. 
Hence the dependence of the results for (5cp on the unknown true values of other parameters 
(especially 623 ) introduces an unpleasant ambiguity. One possibility to deal with this situ¬ 
ation could be to present for each CL the largest conhdence interval for 5cp- Then the CL 
would be a lower bound on the true coverage of the interval, i.e., such an interval at the a 
CL will cover the true value with a probability of at least a. A problem with this approach 
is that one has to maximize the conhdence interval of (5cp with respect to the true value of 
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Figure 5: The cut levels tMc(CL, 0) for Ax^{023) for Icr (red), 2cr (blue), Str (green). Dashed lines indicate 
the Gaussian approximation . Top row: T2K disappearance only, middle row: T2K disappearance and ap¬ 
pearance, bottom row: combined T2K and MINOS data. The columns correspond to = 0,7r/2 , tt, 37 r/2 
from left to right, and we always assume a true normal mass ordering with Am ^2 — 2-4 x 10“^ eV^. The 
black solid curve shows Ax^(023) using the observed data (same curve in the 4 panels in each row). 


023 , and within a frequentist framework it is not possible to decide which range for 623 has 
to be considered.® This problem is solved by considering 2-dimensional conhdence regions in 
both parameters. Before presenting those in section 4.3 below, we proceed now by discussing 
the 1-dimensional intervals for 623 - 

4.2 One-dimensional intervals for 623 

The cut levels for Ay;^(023) obtained from our MC simulation are shown in £g. 5 for T2K 
disappearance data only (top row), for T2K disappearance and appearance data (middle 

similar discussion in the context of the mass ordering determination can be found in ref. [41]. 
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row), and T2K and MINOS combined (bottom row). The columns of panels correspond 
to different values of the CP phase (5cp used to generate the pseudo data. We make the 
following observations: 

1. For non-maximal values | sin^ 623 — 0.5| > 0.05, the test statistic Ax^{023) is distributed 
approximately as a with 1 dof, according to the Gaussian approximation. 

2. For maximal values sin^ 623 — 0.5 the cut levels are somewhat reduced compared to the 
Gaussian case. This is the manifestation of the physical boundary sin^ 2623 < 1, which 
reduces the freedom provided by the parameter 623 . 

3. Disappearance data only (top row) is insensitive to the true value of the phase (5cp. 

4. When appearance data are included we observe a slight increase of the cut levels 
compared to the Gaussian limit for certain combinations of 623 and namely for 

= 7 r/ 2 , sin^ 6*23 > 0.6 and for = 37 r/ 2 , sin^ 6*23 < 0.4. These are the same 
regions where we have noted also an increase in the cut levels for Ax^(5cp) in the 
previous section, corresponding to the cases of strong octant degeneracy, see panels (d) 
and (f) in £g. 3. 

5. The behavior of the cut levels is basically unchanged by adding MINOS data to T2K. 
However, the Ax^(sin^ 623) from the observed data (black curves in the plots) is slightly 
disfavoring maximal mixing. While the confidence intervals for sin^ ^23 based on the 
MG from T2K happen to be very similar to the approximation, some deviations 
from the Gaussian limit occur when MINOS data are added, since the observed Ay^ is 
pushed somewhat into regions where ^mc differs from t^ 2 . 

Fig. 5 corresponds to assuming a normal ordering for generating the MG data. Very 
similar results are obtained also for inverted ordering. We have also investigated the distri¬ 
bution of the 1 -dimensional Ay^ for Am 23 have found very good agreement with the 

Gaussian limit, independent of any other parameters for any combination of T2K and/or 
MINOS data. This reflects the very robust determination of Am‘^2 by the disappearance 
spectral data. 

4.3 Two-dimensional confidence regions 

As we have seen above, the distribution of the test statistic for 5cp depends significantly 
on the unknown true value of 623 . Hence, treating 623 as nuisance parameter leads to the 
unpleasant result that the confidence intervals for (5cp cannot be stated independently of the 
true value of 623- One way to deal with such a situation in a frequentist framework is to 
consider two-dimensional confidence regions of both parameters, keeping in mind that the 
interpretation of the results is different. 

We consider the test statistic Ax^((5cp, ^ 23 ) defined in eq. (5) and simulate the distribution 
of this statistic for a grid of true values in the (5 cP)^ 23) plane. Then in each point in this 
plane the observed value of Ax‘^{Scp, O23) can be compared to the distribution from the MG 
to decide whether this point is included in the confidence region at a given GL. The results 
of such an analysis are shown in fig. 6 for T2K only (left panel) and the T2K -|- MINOS 
combination (right panel). The thick solid curves indicate the confidence regions in the 
space of (5cp and sin^ ^23 at la, 2a, and 3a GL based on the MG simulation. They can be 
compared to the thin dotted curves, which indicate the regions obtained under the Gaussian 
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Figure 6 : Two-dimensional confidence regions at Icr (red), 2cr (blue), Scr (green) in the (sin^023, <5cp) 
plane for T2K (left panel) and T2K -|- MINOS (right panel). Solid curves correspond to the MC simulation, 
whereas dotted curves correspond to the Gaussian approximation. For the MC we assume a true normal 
mass ordering with = 2.4 x 10“^ eV^, while for the fit we minimize with respect to Am §2 including 

its sign. 

approximation, i.e. using the cut levels t ^2 obtained from the distribution for 2 dof. We 
observe that regions obtained from the MC are relatively close to the Gaussian limit. We 
conclude that the 2-dimensional test statistic has “better” statistical properties than the 
1-dimensional one, where the 623 dependence is prohled out. 

This result is further illustrated in £g. 7, which shows sections through the 2-dimensional 
distribution. Those results should not be confused with the ones shown in hgs. 1 and 2, where 
the is minimised with respect to 623 , whereas here we keep it fixed at the assumed true 
value (eq. (4) versus (5)). The MC curves in fig. 7 should be compared to the corresponding 
cut values for a distribution with 2 dof, indicated by the dashed lines in the hgure. We 
observe that the MC cut levels are close to the Gaussian limit. For sin^ 623 — 0.5 we observe 
somewhat smaller tuc values compared to the Gaussian ones, due to the physical boundary 
sin^26'23 < 1 (visible in the right panel in £g. 7). In all cases the variation of the MC cut 
levels with 5cp as well as with 623 is signihcantly reduced compared to the 1-dimensional 
case. We conclude that for present data, the Gaussian approximation to derive conhdence 
regions is more reliable in the (5cp ) ^ 23 ) plane, whereas 1-dimensional confidence intervals 
for the CP phase ^cp suffer from large deviations from Gaussianity. 

For hgs. 6 and 7 we have assumed a true normal mass ordering to generate the MC data. 
The corresponding plots for a true inverted ordering are very similar. We can use the 2- 
dimensional conhdence regions also to quantify the rejection of ^cp = vr/2 by looking for the 
largest CL for which the ( 5 cp, sin^ 6*23) conhdence regions do not contain ( 5 cp = ti/2. In this 
way we hnd from the MC calculation that combined T2K and MINOS data allow to reject 
(5cp = Ti/2 at the 81.8% (83.9%) CL assuming a true normal (inverted) mass ordering. We 
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T2K+MIN0S sin^6»23 =0.4 (NO) 



T2K+MINOS sin^023 =0-6 (NO) T2K+MINOS 7r/2 (NO) 




Figure 7: Sections through the two-dimensional Ax^(5cP: ^ 23 ) distribution for combined T2K and MINOS 
data at constant 6*23 with sin^ 023 = 0.4 (left panel) and sin^ 023 = 0.6 (middle panel), and at constant 
(5cp = 7 r /2 (right panel). Solid curves correspond to tMC for lo" (red), 2a (blue), Scr (green). Dashed lines 
indicate for a distribution with 2 dof. The black solid curves show Ax^((5cp, 023 ) using the observed 
data. 

have Ax^(5cp = 7 i'/ 2 ) = 3.37, which in the Gaussian approximation for 2 dof corresponds to 
the 81.5% CL. We note that the MC results for NO and 10 are very similar and close to the 
one obtained in the Gaussian limit. In contrast, the corresponding rejection conhdence levels 
based on 1 -dimensional conhdence intervals for (5cp vary strongly with the true mass ordering 
and true ^23 and differ signihcantly from the Gaussian limit (see discussion in section 4.1). 
Note that the interpretation of the rejection conhdence levels based on 1-dimensional or 
2 -dimensional conhdence regions is diherent. 

In Fig. 8 we show the 2-dimensionaI Feldman-Cousins conhdence regions in the plane 
of sin^ 023 and Am 32 for the combined T2K and MINOS data. We observe that they agree 
quite well with the standard Gaussian approximation. For generating the MC data we have 
assumed here (5cp = 37 r/ 2 , but the results are very similar for other true values of (5cp- 


5 Increased exposure and T2K anti-neutrino data 

Next we are going to discuss how this situation will change in the near future, for increased 
exposure in T2K or when data on anti-neutrinos become available. Following the T2K 
collaboration [42] we consider an exposure of 7.8 x 10^^ protons-on-target (p.o.t.), approx¬ 
imately a factor 12 larger than the current exposure. We consider two cases, either using 
all of this exposure for neutrino data or equally sharing the exposure between neutrino and 
anti-neutrino running. We depart from the code used for our present-data T2K analysis (see 
section 2 ), and scale the spectrum normalization such that we can reproduce the expected 
number of events given in tables 4 and 5 in [42]. This is a rough approximation, especially 
for the anti-neutrino case, where we ignore the (rather substantial) contribution from the 
neutrino component in the beam. Despite those simplihcations we can reproduce accurately 
the event spectra from £g. 2 in [42], as well as the sensitivity plots based on Asimov data 
given in ref. [42]. Note that total event numbers are still relatively small, 210 (260) events for 
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Figure 8 : Two-dimensional confidence regions at la (red), 2 a (blue), ia (green) in the ( 6 * 23 , Am| 2 ) plane 
for T2K -I- MINOS data. Solid curves correspond to the MC simulation, whereas dotted curves correspond 
to the Gaussian approximation. For the MC we assume a true value of Jcp = 37 r/ 2 , while for the ht we 
minimize with respect to i5cp. 

neutrinos and 49 (35) events for antineutrinos for (5cp = 0 ( 37 r/ 2 ), implying large statistical 
errors. Our approximate implementation suffices to study the expected statistical properties 
of the test statistics. A detailed and accurate sensitivity calculation is beyond the scope of 
this work. 

In fig. 9 we show the cut levels for Ax^((fcp) for different assumptions about the true 6^23 
and mass ordering, both for neutrino-only data and for combining neutrino and anti-neutrino 
data. We find that in both cases a significant dependence on the unknown true values of 
023 and the mass ordering remains. In particular, the significance of excluding values of 
5cp ~ 7 r /2 or 37 r /2 will vary quite strongly. The locations of the dips in the cut levels follow 
the pattern discussed in section 3. The regions of low cut levels for neutrino data-only visible 
for NO, sin^023 = 0.6, (5cp — 37 r /2 (upper right panel) and 10, sin^ 023 = 0.4, 6 cp — 
(lower left panel) correspond to the regions of maximal and minimal oscillation probability 
indicated in the first two lines of eq. (14). Also the dips in the middle panels of fig. 9 (for 
sin^023 = 0.5) follow this argument. 

We also observe from the figure that in many cases using anti-neutrino data has only a 
small impact on the distribution of the test statistics. The only exceptions are sin^ 023 = 0.4, 
NO (upper left) and sin^ 023 = 0.6, 10 (lower right). In those cases cut levels are raised close 
to the Gaussian limit for 0 < ^cp < vt or tt < ^cp A 27r, respectively. In the opposite regions 
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Figure 9: The cut levels tMc(CL, 0) for Ax^(^cp) for future T2K data of 7.8 x 10^^ p.o.t. for Icr (red), 
2a (blue), 3 ct (green). Solid curves assume 100% neutrino running, dashed curves are for 50% neutrino and 
anti-neutrino data, each. Left, middle, right panels correspond to sin^ 02™® = 0.4,0.5, 0.6, respectively. We 
take = 2.4 x 10“^ eV^ and for the upper (lower) row we have assumed a true normal (inverted) 

mass ordering. The mass ordering ambiguity is included in the minimisation. 

of (5cp for those cases cut levels remain low also when anti-neutrino data are added. For 
those regions, there is an octant degenerate solution which basically destroys any sensitivity 
to (5cp, see panel (d) in £g. 3, while adding information from anti-neutrinos signihcantly helps 
in resolving this degeneracy which increases the sensitivity to 5cp. Those regions correspond 
to the minimal and maximal values for the anti-neutrino oscillation probability indicated in 
the last two lines of eq. (14), explaining the good sensitivity of anti-neutrino data for those 
cases. 

In £g. 10 we show the same analysis as in £g. 9, except that we assume that the mass 
ordering is known (determined independently by some other experiment). Hence, we do not 
minimize with respect to the mass ordering, but restrict the £t to the assumed true ordering. 
We observe that the sign(Am| 2 ) degeneracy has a large impact on the distribution of the 
test statistics and cut levels become much closer to the Gaussian approximation.® This is 
different to the situation we hnd for current data discussed in section 4, where the mass 
ordering degeneracy has only a small impact on the distribution of the test statistics. For 

®This is an example where the presence of the degeneracy decreases the effective number of dof, contrary 
to the cases discussed previously, where the presence of a degeneracy increases the number of dof. 
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Figure 10: Same as figure 9, but assuming that the mass ordering is known, i.e. restricting the fit to the 
assumed true ordering. 

neutrino-only data, there are still a few cases of large deviations from Gaussianity, whereas 
the combined neutrino and anti-neutrino data lead to cut levels close to the limit (see 
dashed curves in £g. 10). We still observe small dips for ^cp — and 37r/2 (now pretty 
symmetric around 5cp = vr), which can be traced back to the sin(5cp dependence of the 
probabilities, leading to a reduction of the dof for those values of (5cp. Note also that in 
some cases we hnd now cut levels which are even higher than the Gaussian ones. A possible 
explanation for this behavior can be found in the discussion in section 3, see item 1. 


6 Discussion and conclusions 

We have studied in detail the information we can obtain on the leptonic GP phase (5cp from 
current data, focusing on the robustness of frequentist conhdence regions, by performing a 
Monte Garlo simulation of the data from the T2K and MINOS experiments. We attempt 
to quantify the current preference for (5cp — 3n/2 over 5cp — tt/2. We have focused on the 
interplay of the main unknown parameters, namely dcp, ^ 23 , and the neutrino mass ordering. 
Our hndings can be summarized as follows. 

The distribution of the test statistic used for 1-dimensional confidence intervals for 
^cp shows large deviations from the Gaussian limit. In particular, it strongly depends on the 
unknown true value of 623 . This introduces an ambiguity in the conhdence intervals for 5cp, 
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see fig. 4. While the la interval for (5cp is relatively stable and close to the Gaussian approx¬ 
imation, at higher conhdence level large variations occur. In particular, the CL with which 
values of (5cp ~ 7 r /2 are disfavored ranges from 2a to 3a, depending on 623 and the mass 
ordering. We can trace back the origin of those results to the complicated non-linear param¬ 
eter dependence of the relevant oscillation probabilities (trigonometric dependence of (5cp 
and 6 ' 23 -octant and mass ordering degeneracies), combined with the rather poor sensitivity 
of current data to ^cp- 

We conclude that one should not use the Gaussian approximation when making state¬ 
ments about ^cp based on the 1-dimensional test statistic. Typically the “true” con¬ 
hdence levels obtained from the Monte Carlo simulation lead to more restrictive conhdence 
intervals and to stronger rejections of values of (5cp around 7 r/ 2 . In this sense the use of the 
Gaussian approximation is conservative. We have shown that the Gaussian approximation 
is better justihed for 2 -dimensional conhdence regions in the plane of 5cp and sin^ 623 , see 
hg. 6 . In particular, the dependence of the Ay^ distribution on the true values of 5 cp and 
623 is much less severe in the 2-dimensional case. The 2-dimensional conhdence region in 
the ((5 cP)^ 23) plane for combined T2K and MINOS data does not include (5cp = n/2 up to 
the 81.8% (83.9%) CL assuming a true normal (inverted) mass ordering.^ Those values are 
close to the CL of 81.5% obtained under the Gaussian approximation. 

We have considered also the 1-dimensional conhdence intervals for 623 and Am‘^ 2 - 
623 we hnd approximately Gaussian behavior, with some deviations around maximal mixing, 
see hg. 5. This is a manifestation of the boundary sin^ 2623 A 1, which implies that the 
derivative of the event rates predicted for the disappearance channel with respect to 623 is 
zero for 623 = 45°. The test statistic for Am |2 has a distribution very close to the Gaussian 
limit, as well as the 2 -dimensional conhdence regions in the (sin^ 6 * 23 , Am| 2 ) plane. 

In section 5 we have studied the distribution of the 1-dimensional Ay^ test statistic for 
(5cp assuming an increased exposure for T2K of 7.8 x 10^^ protons-on-target, roughly a factor 
12 larger than current exposure, where we consider also the possibility of using half of this 
exposure for anti-neutrino running. We hnd that even in this case large deviations from the 
Gaussian behavior can be expected. Typically reduced cut levels for the Ay^ are obtained 
around either ^cp — ^/2 or 37 r/ 2 , depending on the unknown true value of 623 and the 
mass ordering. Close to Gaussian behavior is only obtained for neutrino plus anti-neutrino 
running and assuming that the neutrino mass ordering is known. 

Let us mention that in the global ht of all oscillation data also SuperKamiokande at¬ 
mospheric neutrino data contribute to the determination of 623 and to a small extent also 
of (5cp, see ref. [17] for a discussion. Ideally a combined MG simulation of long-baseline 
and atmospheric neutrino data should be performed, which however, is not feasible due to 
the numerical complexity of the atmospheric neutrino ht. Since atmospheric neutrino data 
play only a subleading role for 5cp we expect that the results presented here would not be 
modihed substantially by including atmospheric neutrinos. 

For the investigation of near term future data in section 5 we have not considered the 
NOvA experiment [43,44], from which data will become available during the next years. In 
general we expect improved behaviour of the relevant test statistics, since complementary 
data from NOvA may help to resolve some of the degeneracies (see, e.g. refs. [28,29] for recent 

^When first data [34] from T2K on 17^ —)■ Ve are included, the corresponding numbers are 86.3% 
(89.2%) CL, see appendix. 
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studies), which—as we have shown—play a crucial role for the deviations from Gaussianity. 
The same is true also for experiments aiming to determine the neutrino mass ordering, 
see ref. [41] for a discussion and references. An exhaustive investigation of the expected 
statistical properties of future data is beyond the scope of this work. Some discussion along 
those lines in the context of CP violation can be found in ref. [32]. It will be an interesting 
topic for future work to study sensitivities to 5cp of combined data from NOvA, T2K, and 
other upcoming experiments based on the true distributions of the relevant test statistics. 
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A Impact of first anti-neutrino data from T2K 

After completion and submission of this paper, the first anti-neutrino results from T2K were 
presented at the EPS HEP 2015 conference [34]. In this appendix we show the impact of 
those data on the determination of ^cp- 

The results presented in [34] corresponds to about 4 x 10^° p.o.t. in the anti-neutrino 
mode with 3 observed events in the appearance channel.^ The expected background (NC 
and other) is 1.17 events, while the predicted signal from —)■ W —)• induced events 

ranges from 2 to 4 (0.3 to 0.6) events, depending on (5cp and the mass ordering. We could 
reproduce the predicted number of events given in [34] with good accuracy, which allows us 
to include those data and combine it with the other data used in this work. We use a 
as given in eq. (2) with just one bin (only the total number of events is htted), taking into 
account the background expectation, as well as oscillated anti-neutrino and neutrino event 
predictions. 

Clearly the statistical significance of those results is poor, since 3 observed events are 
even consistent with the background only hypothesis of about 1.17 events (no oscillation 
induced events at all) at slightly more than la. Therefore we expect that those data will 
change Ay^ by about 1 unit. Nevertheless, anti-neutrinos carry complementary information 
to the neutrino data and therefore it may be interesting to investigate the impact of those 
results. 

Fig. 11 shows the impact of the anti-neutrino data on the 1-dimensional confidence inter¬ 
val for 5cp- The solid curves, both for the cut levels as well as for Ay^(5cp) are without T2K 
anti-neutrinos and are identical to fig. 2, while the dashed curves include the information 
from the T2K anti-neutrino events. Comparing the black solid and dashed curve we hnd 
that Ay^ for (5cp = 7r/2 is increased by about 0.75. However, also the cut levels are increased 
by a similar amount and hence the significance of rejecting (5cp = 7r/2 is hardly affected. 
For sin^ 623 = 0.4 the effect is most pronounced and in those cases the signihcance actually 
decreases, despite the increased Ay^. 

®We do not consider the disappearance channel here. 
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Figure 11: The cut levels tMc(CL, 0) for Ax^((5cp) from combined T2K and MINOS data for Icr (red), 2a 
(blue), 3 ct (green). Left, middle, right panels correspond to sin^ 02™® = 0.4, 0.5,0.6, respectively. We take 
I = 2.4 X 10“^ eV^ and for the upper (lower) row we have assumed a true normal (inverted) mass 

ordering. The black curves show Ax^((5cp) using the observed data (same curves in all panels). Solid curves 
correspond to the data given in tab. 1 (no T2K anti-neutrinos, same as in fig. 2), dashed curves include T2K 
anti-neutrino data. 

In fig. 12 we show the 2-dimensional conhdence region in the (5 cp, ^ 23 ) plane for T2K as 
well as T2K-I-MINOS data, including the T2K anti-neutrino events in both cases. This hgure 
should be compared with £g. 6, which shows the corresponding regions without T2K anti¬ 
neutrinos. As expected the difference is small, with the size of the conhdence region being 
slightly decreased due to the new data. Including T2K anti-neutrinos we hnd that combined 
T2K and MINOS data allow to reject 5 cp = 7r/2 at the 86.3% (89.2%) CL assuming a 
true normal (inverted) mass ordering, to be compared with 81.8% (83.9%) CL without T2K 
anti-neutrinos. For Ax^((5cp = ^/2) we hnd now a value of 4.27, which in the Gaussian 
approximation for 2 dof corresponds to the 88.2% CL. 
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